Search CORE

2 research outputs found

Mocarabe: High-Performance Time-Multiplexed Overlays for FPGAs

Author: Mellat Alireza
Publication venue: 'University of Waterloo'
Publication date: 24/01/2022
Field of study

Coarse-grained reconfigurable array (CGRA) overlays can improve dataflow kernel throughput by an order of magnitude over Vivado HLS on Xilinx Alveo U280. This is possible with a combination of carefully floorplanned high-frequency (645 - 768 MHz Torus, 788 - 856 MHz Mesh, 583 - 746 MHz BFT) design and a scalable, communication-aware compiler. Our CGRA architecture supports configurable Processing Element (PE) functionality supported by a configurable number of communication channels to match application demands. Compared to recent FPGA overlays like 4×4 ADRES and HyCUBE implementations in CGRA-ME, our design operates at a faster clock frequency by up to 3.4×, while scaling to an orders-of-magnitude larger array size of 19×69 on Xilinx Alveo U280. We propose a novel topology agnostic ILP placer that formulates the CGRA placement problem into an ILP problem. Our ILP placer optimizes placement regardless of topology and even for non-linear objective functions by using pre-computed placement costs as inputs to the ILP problem formulation. Using the ILP placer reduces placement quadratic wirelength up to 37% compared to the commonly used simulated annealing approach but increases runtime from less than a minute to hours. Our communication-aware compiler targets HLS objectives such as initiation interval (II) and minimizes communication cost using an integer linear programming (ILP) formulation. Unlike SDC schedulers in FPGA HLS tools, we treat data movement as a first-class citizen by encoding the space and time resources of the communication network in the ILP formulation. Given the same constraints on operational resources as Vivado HLS, we can retain our target II and achieve up to 9.2× higher frequency. We compare Torus and Mesh topologies, and show Mesh has less latency per area compared to Torus for the same benchmarks

University of Waterloo's Institutional Repository

On Predicting the Ultimate Capacity of a Large-Span Soil–Steel Composite Bridge

Author: A Wadi
A Wadi
A Wadi
AASHTO
B Kunecki
B Taleb
BM Das
C Regier
CSA Canadian Standards Association
CSA Canadian Standards Association
D Beben
D Beben
D Beben
D Beben
D Beben
D Beben
D Becerril García
D-H Choi
E Bayoǧlu Flener
E Bayoǧlu Flener
E Bayoǧlu Flener
E Bayoǧlu Flener
European Committee for Standardization
G Abdel-Sayed
H Mohammed
ID Moore
ID Moore
J Vaslestad
JE Bowles
JM Duncan
JM Duncan
K Klöppel
KM El-Sawy
KY Yeau
L Pettersson
L Pettersson
LH White
M El-Taher
MC Webb
P Mellat
RWI Brachman
S Alireza
S Alzabeebee
S Timothy
T Morrison
TM Elshimi
VT Mai
Y Girges
Y Liu
Y Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref